AITopics | navigation step

Collaborating Authors

navigation step

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

2e5c2cb8d13e8fba78d95211440ba326-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 01:59:01 GMT

Finally, Section E illustrates qualitative results. We present the encoder-decoder variant of HAMT in fine-tuning on the right of Figure 1. Compared to the original cross-modal transformer on the left, the variant removes text-tovision cross-modal attention. The encoder encodes the texts to obtain textual embeddings. Theoriginal target location is viewed as a middle stop point.

artificial intelligence, instruction, predictedtrajectorybyhamt, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Narrowing the Gap between Vision and Action in Navigation

Zhang, Yue, Kordjamshidi, Parisa

arXiv.org Artificial IntelligenceAug-19-2024

The existing methods for Vision and Language Navigation in the Continuous Environment (VLN-CE) commonly incorporate a waypoint predictor to discretize the environment. This simplifies the navigation actions into a view selection task and improves navigation performance significantly compared to direct training using low-level actions. However, the VLN-CE agents are still far from the real robots since there are gaps between their visual perception and executed actions. First, VLN-CE agents that discretize the visual environment are primarily trained with high-level view selection, which causes them to ignore crucial spatial reasoning within the low-level action movements. Second, in these models, the existing waypoint predictors neglect object semantics and their attributes related to passibility, which can be informative in indicating the feasibility of actions. To address these two issues, we introduce a low-level action decoder jointly trained with high-level action prediction, enabling the current VLN agent to learn and ground the selected visual view to the low-level controls. Moreover, we enhance the current waypoint predictor by utilizing visual representations containing rich semantic information and explicitly masking obstacles based on humans' prior knowledge about the feasibility of actions. Empirically, our agent can improve navigation performance metrics compared to the strong baselines on both high-level and low-level actions.

agent, navigation, waypoint predictor, (13 more...)

arXiv.org Artificial Intelligence

2408.10388

Country:

Oceania > Australia > Victoria > Melbourne (0.05)
North America > United States > Michigan > Ingham County > Lansing (0.04)
North America > United States > Michigan > Ingham County > East Lansing (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Predictably Smart

#artificialintelligenceApr-17-2018, 05:46:15 GMT

It's easy to feel like new ML technologies for us to rethink everything about UX design, but that's not quite true. The emergence of ML doesn't change the fact that the most usable, delightful UIs are those that embody principles of good design--like habituation--that many designers and researchers (Don Norman, Jakob Nielsen, Steve Krug, and Jeff Johnson to name a few) have been writing about for years. Evaluating recommendations or visually searching the interface for content counts as a navigation step, just like a tap or click. No ML-based suggestion will be "helpful" enough to offset breaking your user's flow state and muscle memory. But if you're confident that the user has a more open-ended goal like exploration, you have more leeway to put dynamic, ML-based features at the forefront of your UI.

assistance, ml assistance, navigation step, (8 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

Multi-Select Faceted Navigation Based on Minimum Description Length Principle

He, Chao (Chinese Academy of Sciences) | Cheng, Xueqi (Chinese Academy of Sciences) | Guo, Jiafeng (Chinese Academy of Sciences) | Shen, Huawei (Chinese Academy of Sciences)

AAAI ConferencesJul-19-2011

Faceted navigation can effectively reduce user efforts of reaching targeted resources in databases, by suggesting dynamic facet values for iterative query refinement. A key issue is minimizing the navigation cost in a user query session. Conventional navigation scheme assumes that at each step, users select only one suggested value to figure out resources containing it. To make faceted navigation more flexible and effective, this paper introduces a multi-select scheme where multiple suggested values can be selected at one step, and a selected value can be used to either retain or exclude the resources containing it. Previous algorithms for cost-driven value suggestion can hardly work well under our navigation scheme. Therefore, we propose to optimize the navigation cost using the Minimum Description Length principle, which can well balance the number of navigation steps and the number of suggested values per step under our new scheme. An emperical study demonstrates that our approach is more cost-saving and efficient than state-of-the-art approaches.

navigation cost, navigation scheme, suggested value, (16 more...)

AAAI Conferences

Twenty-Second International Joint Conference on Artificial Intelligence

Country: Asia > China > Beijing > Beijing (0.04)

Genre:

Workflow (0.49)
Research Report > Promising Solution (0.34)

Industry: Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory > Minimum Complexity Machines (0.71)

Add feedback